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Abstract. In this paper we explore the idea that Teichmiiller space with the 
Teichmuller metric is hyperbolic "on average." We consider several different 
measures on Teichmuller space and show that with respect to each one, the 
average distance between points in a ball of radius r is asymptotic to 2r, which 
is as large as possible. 



Let S be a closed surface of genus g > 1. In this paper we continue the study of 
Teichmuller space T(S), which is the parameter space for several types of geometric 
structures on S. It is known that Teichmuller space equipped with the Teichmuller 
metric dj- is a complete metric space homeomorphic to IR 65 ~ 6 . It is not 5-hyperbolic 
[IB] , and several kinds of obstructions to hyperbolicity are known: for instance, pairs 
of geodesic rays through the same point may fellow-travel arbitrarily far apart [10] , 
and there are large "thin parts" of the space which, up to bounded additive error, 
are isometric to product spaces equipped with sup metrics [17] (and therefore not 
hyperbolic). These exceptions to the negative-curvature phenomena seem to come 
from rare occurrences, so one might expect properties that are characteristic of 
hyperbolicity to hold on average. 

One way to make this precise is to consider the average distance between points 
on metric spheres in a metric space (X, d) . Writing S r (x) for the sphere of radius r 
based at x, we define a geometric statistic for the large spheres as follows. Given a 
family of probability measures fx r on the spheres S r (x), let E(X) = E{X, x, d, {fi r }) 
be the average distance between points on large spheres: 



if the limit exists. It is shown in [6] that non-elementary hyperbolic groups all have 
E(G, S) = 2 for any finite generating set S; this is also the case in the hyperbolic 
space M. n of any dimension endowed with the natural measure on spheres. By 
contrast, it is shown that E(W l ) < 2 (increasing over the range [ 4 /?r, y/2) as the 
dimension goes from 2 to infinity), and that E(Z n ,S) < 2 for all n and S, with 
nontrivial dependence on S. (See for more examples.) 

In the case of Teichmuller space, one could consider a number of a priori dif- 
ferent measures that are natural from various points of view. For instance, as a 
metric space T(S) carries a (6g — 6)-dimensional Hausdorff measure r\. Other mea- 
sures come from the Finsler structure (Busemann measure and Holmes-Thompson 
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measure), from the holonomy coordinates on the cotangent bundle (holonomy, or 
Masur-Veech, measure), and from the symplectic structure. In Sj3l we find that 
all of these are absolutely continuous with respect to each other, and in fact are 
bounded in terms of each other. Furthermore, the sphere S r (x) can be identified 
with the unit sphere Q}(x) in the vector space of quadratic differentials on x via 
the Teichmuller map. The latter has various natural measures, and corresponding 
measures on S r (x) will be called visual measures; we will pay special attention to 
two standard visual measures, denoted Vis^) and Vis^). 

Our main theorem concerns the average distance between points in the ball 
B r (x) of radius r centered at x. We seek to show that the average distance between 
points in B r {x) is asymptotic to 2r, which, in light of the triangle inequality, is 
the maximum possible distance. Two axioms are needed for this conclusion: a 
measure must satisfy a thickness estimate, guaranteeing that typical rays spend 
a definite proportion of their time in the thick part, and an exponential decay 
estimate, asserting that fellow-traveling becomes exponentially rare. All of the 
measures discussed above satisfy these axioms. 

Theorem 1. Suppose /i satisfies a thickness estimate and an exponential decay 
estimate. Then for every point x £ T(S), 



With respect to the standard visual measures, the same methods produce the 
following result. 

Theorem 2. E(T(S)) = 2 with respect to Teichmuller distance and the standard 
visual measures. 

We sketch here the main ideas in the proofs. The first step is to show ( §5.2[) that 
most pairs of geodesies separate from each other in the Teichmuller metric after a 
threshold time. Then one would hope that, as in a hyperbolic space, the geodesic 
joining their endpoints would follow the first geodesic back to approximately where 
they separate before following the other so that its length is roughly the sum of 
the lengths of the two geodesies, as on the left in Figure [TJ The Minsky product 



Figure 1. We will show that the geodesic between points on 
generic rays "dips" back near the basepoint. This requires an anal- 
ysis of the time spent in thick and thin parts: if [x, y'] and [x, z'\ lie 
in thin parts corresponding to disjoint subsurfaces, then Minsky's 
product regions theorem shows that the connecting geodesic can 
take a "shortcut." We show that this effect is rare. 
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regions theorem [T7] says that this in fact may not happen. If the pair of geodesies 
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separate because they enter thin parts corresponding to disjoint subsurfaces, then 
the geodesic joining their endpoints travels through those thin regions simultane- 
ously and its length is then smaller than the sum, as on the right in Figure [TJ Our 
goal is to show that this phenomenon does not happen on average. The mechanism 
for showing this is the coarsely contracting map from T(S) to the curve complex. 
Using thickness statistics for the rays, we show (Theorem |40|) that most pairs of 
Teichmuller geodesies separate in the curve complex after a bounded time, which 
rules out shortcuts through product regions. We then show (Theorem BT|) that for 
most pairs, a geodesic connecting the two rays must pass through the region where 
the rays separated, which immediately provides the needed bounds. 

Along the way we use a wide array of tools in Teichmuller geometry, including 
some that are quite recently developed: subsurface projection and reverse trian- 
gle inequalities (following Masur-Minsky and Ran) , time-ordering of thin intervals 
along Teichmuller geodesies (Ran), antichain bounds (Rafi and Schleimer); vol- 
ume asymptotics in Teichmuller space (Athreya-Bufetov-Eskin-Mirzakhani) and 
random walks as discretized Teichmuller geodesies (Eskin-Mirzakhani) . We also de- 
velop several facts that can serve as future tools, including a simplified formula for 
Teichmuller distance ( §2.4|1 . comparisons of measures O, and the results showing 
that thickness estimates for rays ensure that a typical ray makes definite progress in 
the curve complex and that typical pairs obtain big distance in the curve complex 
and big distance in Teichmuller space f ^6.HI6.2ll6.3p . 

1.1. Acknowledgments. We would like to thank Alex Eskin, Benson Farb, Curtis 
McMullen, and especially Kasra Rafi for numerous helpful comments and explana- 
tions. 



2.1. Teichmuller space and quadratic differentials. Recall that Teichmuller 
space T(S) is the space of marked Riemann surfaces X that are homeomorphic to 
the topological surface S. More precisely, it consists of pairs (A, /), where / : S — > 
A is a homeomorphism, up to the equivalence relation that (Ai,/i) ~ (A2, f%) 
when there exists a conformal map F : X\ — > X2 such that F o f 1 is isotopic to /2. 
Alternately, we may define T(S) as the space of marked hyperbolic surfaces (p, /); 
namely, maps / : S — > p with (pi, fi) ~ (pi^f-i) when there exists an isomctry 
F : pi — > P2 s.t. Fo/j is isotopic to /2- 

Using the first definition of T(S), the Teichmuller distance is given by 



where the minimum is taken over all quasiconformal maps F and K(F) is the 
maximal dilatation of F. The space T(S) is homeomorphic to the ball IR 69-6 , and 
from now on we will use h = 6g — 6 to designate this dimension. In this paper, 
we will denote a point of T(S) by x, regarding it either as a Riemann surface or a 
hyperbolic surface, and suppressing the marking /. 

For 7~(S), the Teichmuller geodesic segment joining x to y will usually be 

denoted [x,y]. We will also use j(t) to denote a geodesic ray or segment when the 
time parameter is important. 

A quadratic differential on a Riemann surface A is a holomorphic 2-tensor q = 
<fi(z)dz 2 on X. The space of all quadratic differentials on all Riemann surfaces 
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homeomorphic to S is denoted Q(S). A point of Q(S) will be denoted q, with 
the underlying complex structure implicit in the notation. The real dimension of 
Q(S) is 12g — 12 = 2h. Reading off the Riemann surface, we obtain a projection 
to the Teichmuller space n : Q(S) —> T(S). Under this projection, Q(S) forms 
vector bundle over T(S) which is canonically identified with the cotangent bundle 
of T(S). Each fiber Q(X) is equipped with a norm given by the total area of q; 
namely ||q|| = J x \4>(z)dz 2 \. Recall that df is not a Riemannian metric on T(S), 
but rather a Finsler metric; it comes from dualizing the norm on Q to give a norm 
on each tangent space of T(S) that is not induced by any inner product. 

It is the famous theorem of Teichmuller that the infimum in the definition of dj- 
is realized uniquely by a Teichmuller map from X\ to X2- A Teichmuller map is 
determined by an initial quadratic differential q — <j)(z)dz 2 on X\ and the number 
K . The Teichmuller map expands along the horizontal trajectories of q by a factor 
of K x l 2 and contracts along the vertical trajectories by the same factor to obtain 
a terminal quadratic differential q' on the image surface Xi. If we fix q and let 
K = e 2t vary over t £ [0, oo) we get a Teichmuller geodesic ray. 

Recall that the mapping class group of S 1 , defined by 

Mod(S) := Diff + (S)/Diff (S'), 

is the discrete group of orientation-preserving diffeomorphisms of S, up to isotopy. 
This group acts isometrically on T(S) by changing the marking: <j> ■ (X, f) = 
(X,f o (f)^ 1 ). In fact, by a result of Royden [21], Mod(S) is the full group of 
(orientation-preserving) isometries of (T(S),df). 

2.2. Curve graph. When we speak of a curve on S, this will mean an isotopy 
class of essential simple closed curves. Given x <E T(S), the length l x (a) of a curve 
a is the length of the geodesic in the isotopy class in the hyperbolic metric x. 

We recall the definition of the curve complex (or curve graph) C(S) of S. The 
vertices of C(S) are curves on S. Two vertices are joined by an edge if the cor- 
responding curves can be realized disjointly. Assigning edges to have length 1 we 
have a metric graph. Properly speaking, C(S) is the flag complex associated to 
this curve graph, but since we are working coarsely, we can identify C(S) with the 
graph. 

It is known that the curve graph is hyperbolic [15]. That is, there exists a 
constant S > such that every geodesic triangle in C(S) is 5-thin: each side of 
the triangle is contained in the union of the ^-neighborhoods of the other two 
sides. It follows that every geodesic quadrilateral in C(S) is 2i5-thin (each side is 
within 25 of the union of the other three sides). Furthermore, in any (5-hyperbolic 
metric space and for any quasi-isometry constants (K,C), there exists a constant 
r, depending only on S, K, C, such that any two (K, C)-quasi-geodesic segments 
with the same endpoints remain within r of each other. Since actual geodesies are 
(1, 0)-quasi-geodesics, this implies that every (K, C)-quasi-geodesic quadrilateral 
is 2(6 + r)-thin. 

2.3. Thick parts and subsurface projections. For any given eo, we say a curve 
is eo-short if its hyperbolic length is less than eo- Then define 7^ , the t^-thick part 
of Teichmuller space, to be the subset of x € T{S) on which no curve is eo-short. 

For each x e T(S) there is associated a Bers marking fi x . To construct fi x , 
greedily choose a shortest pants decomposition of the surface (a collection of 3g — 3 
disjoint simple geodesies). Then for each pants curve (3, choose a shortest geodesic 
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crossing (3 minimally (either once or twice depending on the topology) that is 
disjoint from all other pants curves. The total collection of 6g — 6 curves is called 
a Bers marking and is defined up to finitely many choices. 

Throughout, a proper subsurface of S will mean a compact, properly embedded 
subsurface V C S which is not equal to S and for which the induced map on 
fundamental groups is injective. Subsurfaces which are isotopic to each other will 
not be considered distinct. The proper subsurfaces of S fall into two categories, 
annuli and non-annuli, which behave somewhat differently. Nevertheless, we will 
strive to develop intuitive notation under which these two possibilities may be dealt 
with on equal footing. 

Every proper subsurface V has a nonempty boundary dV consisting of a disjoint 
union of curves on S. We say that two subsurfaces V and W transversely intersect, 
denoted V fti W, if they arc neither (isotopically) disjoint nor nested. In this case, 
dV necessarily intersects W, and dW intersects V. 

Consider a non-annular subsurface V, possibly equal to S. The subsurface pro- 
jection nv(f3) of a simple closed curve (3 C S to V is defined as follows: Realize 
(3 and dV as geodesies (in any hyperbolic metric on S). If f3 C V, then ttv(P) is 
defined to be [3. If (3 is disjoint from V, then irv(P) is undefined. Otherwise, [3 n V 
is a disjoint union of finitely many homotopy classes of arcs with endpoints on dV, 
and we obtain 7Ty(/3) by choosing any arc and performing a surgery along dV to 
create a simple closed curve contained in V. The subsurface projection of a point 
x G T{S) is then defined to be the collection 

ir v (x) := {TT V (f3)} 0e ^ x 

of curves obtained by varying (3 in the Bers marking at x. This is a non-empty 
subset of the curve complex C(V) with uniformly bounded diameter. 

Definition 3 (Non-annular projection distance). For a non-annular subsurface 
V <Z S, the projection distance in V of a pair of points x, y G T{S) is defined to be 

d v (x,y) := diam c(y) (7r v (a;) U n v (y)). 

In particular, ds(x,y) denotes the curve complex distance. When convenient, we 
will also denote this distance by dc(v) '■— dy- 

For an annular subsurface A C S with core curve a = OA, there are two kinds 
of projection distances: one that measures twisting about a and is analogous to 
the definition above, and a second which also incorporates the length of a. Any 
simple closed curve [3 that crosses a may be realized by a geodesic and then lifted 
to a geodesic (3 in the annular cover A, that is, the quotient of H 2 by the deck 
transformation corresponding to a, with the Gromov compactification. For a pair 
(3, 7 of such curves, we may then consider the intersection number i(/3, 7) in A. The 
twisting distance in A of a pair of points x, y G T(S) is then defined as 

d C (A){x,y) ■= sup ^03,7)- 

We additionally define a hyperbolic projection distance as follows. 

Definition 4 (Annular projection distance). For an annular subsurface A C S with 
core curve a = OA, denote by M a a copy of the standard horoball {Im(z) > 1} C H 2 . 
Given x, y G T(S), we consider the points (0, l/l x (a)) and (dc(A)( x , y), l/l y (a)) G 
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H 2 and denote their closest point projections to the horoball H Q by 
TTa(%) = I 0, max < 1, 



L >7^)})> 7r a (») = (4 C (A,(*,y),inax|l,^ 



a) 

The projection distance in A (or hyperbolic distance du a ) between x and y is then 
defined to be 

d A (x,y) := d W 2 (n a (x),n a (y)) . 
2.4. Distance formula. For functions /, g and constants K > 1, C > 0, we will 

K C 

use the notation f(x) x g(x) if the inequalities j^g{x) — C< f(x) < K ■ g{x) + C 
hold for all x. As usual we denote / = O(g) if the second inequality holds, and 
/ = o(g) if f(x)/g(x) —> as x — > oo. 

The following distance formula due to Rati relates the Teichmiiller distance be- 
tween two points x and y to the combinatorics of the corresponding Bers markings 
fi x and fly. 

Theorem 5 (Distance formula, Ran 18 ). Fix a small e > 0. For any sufficiently 
large threshold M , there exist quasi-isometry constants K > 1 andC > depending 
only on M and the topology of S such that, for all x, y £ T{S) we have 



dr(x, y) K ~ d s (x, y)+^2 i d v(x, y)] Mo + max d Ma {x, y) 

where T xy is the set of ea -short curves in both x and y, T x is the set of curves that 
are eq-short in x but not in y, and T v is defined similarly. Here and throughout, 
log + is a modified logarithm so that log + a = for a € [0, 1], and [-] M is a threshold 
function for which [N] M :— N when N > Mq and [N] M :— otherwise. 

We will make all annular measurements with the hyperbolic distance on the H a , 
obtaining a particularly simple restatement of this formula. 

Proposition 6 (Repackaged distance formula). With the same hypotheses as above, 
we have: 



(1) d T (x,y) ^ ds(x,y) +^T[dY(x,y)] 



M 



Here, the sum is over all (annular and non-annular) proper subsurfaces. 

Remark 7. The definition of e?e Q = &a given above is technically different than 
that used by Rah in |18j ; however, the two definitions agree up to bounded additive 
error. 

This version of the distance formula treats annular and non-annular subsurfaces 
on equal footing. For simplicity and without loss of generality, below we suppose 
that eo is fixed small enough that log ,(l/eo) > 100, say. We begin with a straight- 
forward reformulation. 
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Lemma 8. Given any sufficiently large threshold Mo, there exist K > 1 and C > 
such that for all x, y G T(S) we have: 

dr(x, y) K ^ d s (x, y) + Y l d v(x, y)} Mo + E \ d A& v)\ Mo + 
v A-.dAer xy 

£ [max{log + (d c(A) (x, y )) -log + (^) 



A:dA£T x 



log M 



Proof. Since T xy , T x and T y each contain at most 3gr — 3 curves, each max over 
these sets is within bounded multiplicative error of the corresponding sum, and 
applying a threshold only creates bounded additive error, so the first three terms 
of the lemma are established. By the definition of T x we have 

1 



l x {a) 



Since this is a sum with at most 3c? — 3 nonzero terms, we can increase the threshold 
to any number M > 1/eo with bounded additive error. Finally, for functions /, g, h, 
we have 

3 

lo S+ LfVo + lo S+ Mmo + lo S+ Wmo ~ [max{log + /, log +5 , \og + h}] log Mq . □ 

We now show that each term in the last summand is bilipschitz equivalent to 
the corresponding hyperbolic distance d A {x,y). 

Lemma 9. Consider an annular subsurface A C S with core curve dA = a. For 
each pair of points x, y € T(S), set 

H A (x, y) := max |log + (d c{A) (x, y)) , log + (^) , log + (j^j } . 

If x,y G T{S) are such that a ^ T xy and either d A (x, y) or H A [x, y) is greater than 
361og + (l/e ), then 6~ 1 d A (x, y) < H A (x,y) < M A {x,y). 

Proof. Choose points x,y € T(S) that satisfy the hypotheses. To fix notation, set 
n 'a( x ) = (0' 1) an d ^aiu) = (dc(A)( x , y), 1). These are the closest-point projections 
of ir a (x) and TT a (y) to the horocycle bounding H Q , and their distances from these 
points are exactly given by log + (l /l x (a)) and \og + (l/l y (a)). Let 

B = d n 2(Tr' a (x),n' a (y)) = arccosh ^1 + dc ( A ^ ,y ^ 

denote the hyperbolic distance between these projections. Using this formula, one 
may easily check that the inequalities 

(2) \og + d c{A) (x, y) < B < 4 log + d c(A) (x, y) 

hold provided that either B > 3 or dc( A )(x, y) > 3. 

Applying the triangle inequality with the points Tr' a (x) and n' a (y) implies that 



(3) d A (x,y)<log + — - + B + log 

Then (O, (0]), and the definition of H A imply that d A (x, y) < 6H A (x, y) in the case 
that B > 3. If B < 3, we claim that the hypotheses of the Lemma ensure that 
B cannot be the largest term on the right-hand side and therefore that d A (x,y) < 
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3L < 3Ha(x, y), where L denotes the larger of the other two terms. Indeed, if B 
were the largest term and B < 3, then (|3|) would imply (La{x, y) < 9, and ([2]) would 
necessitate \og + dc(A)( x , y) < 3 so that Ha{x,v) < 3. But then both d,A and Ha 
are less than 9, contradicting the hypothesis. 

By the above, the assumption d,A{x,y) > 361og + (l/eo) implies that HA(x,y) > 
61og + (l/eo); therefore all cases will be covered by proving that this in turn im- 
plies HA(x,y) < 6dA(x,y). Without loss of generality, we may assume that 
£x(gO < ly( a )i since a (£ T xy this guarantees l y (a) > eo- First suppose that 
log + d C (A)(x,y) > 31og + (l/^(a)), inwhichcasewehavelog + d c(y!l )(a:,y) = H A {x,y) > 
61og + (l/eo). In particular we certainly have dc(A)(x,y) > 3; thus ^fy and the tri- 
angle inequality give 

\o g+ d c{A) (x, y)<B< log + (Jl^I + d A (x, y) + log + . 

Therefore HA(x,y) = log + dc( J 4)(a;, < 3dA(x,y) in this case. The remaining pos- 
sibility \og + d C ( A ){x,y) < 31og + (l/Z x (a)) necessitates 31og + (l/Z x (a)) > H A {x,y). 
Recall that Tt' a {x) is the closest point projection of n a (x) to the horocycle bounding 
Hq,; since Tr' a (y) is also on this horocycle we have 

lo S+( 77-T J - d vp(ir a {x),n' a (y)) < d A (x,y) + log 



The assumptions 3 log + (l/^(a)) > HA(x,y) > 61og + (l/e ) and Zj,(a<) > eo now 
ensure that HA{x,y) < QdA{x,y). □ 

Corollary 10. Let HA(x,y) be defined as in Lemma® Then for any threshold 
Mo > 361og + (l/eo) and any x, y € T(S) we have 

E [d A (x,y)] 6Mo < E [HA{x,y)] Mo < E Q i d A{x,y)] Mo/6 

With these estimates, we can derive the simplified distance formula. 

Proof of Repackaged Distance Formula. Fix a small eo > and choose any suffi- 
ciently large threshold Mo such that Lemma [5] holds for both e 6M ° and -Mo/6 and 
such that M /6 > 361og + (l/eo). Let K > 1 and C > denote the larger of the 
quasi-isometry constants given by Lemma [8] for the thresholds e 6M ° and Mo/6. 

Notice that, in any sum of the form [f]w ra i sm g the threshold can only 
decrease the value of the sum, and lowering the threshold can only increase its value. 
Therefore, combining Lemma [5] and Corollary 1101 we find that for any x,y € T{S) 
the various distances satisfy 

d T < K Ids + E + E [ddefixo + E I h a] 6 m„ + C 

\ V dAer^y dA^T^y J 

<eK[d 8 + Y,[d v ] Mo + E [^U+ E i d A] Mo )+c, 

V V 9Aer x „ dA£r xv ] 



where we have suppressed the x and y in the notation. The lower bound on dq-(x, y) 
is similar. □ 
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2.5. Thinness and time-ordering. We will use some results from Rafi's work 
[18\ Prop 3.7] combinatorializing the Teichmuller metric. For every Teichmiiller 
geodesic and every proper subsurface V, there is a (possibly empty) interval along 
the geodesic where dV is short. Outside of this interval, the projections dy move 
by at most a bounded amount. In the form that we will use below: there is a global 
constant M and constants eo < £i such that for any pair of points x, y £ T(S) there 
is a possibly nonempty connected interval Ty along the geodesic segment [x, y] such 
that 

• for a £ TV the length of dV on a is at most ei; 

• for a £ [x, y] \ Ty the length of dV on a is at least eo; 

• for a, b in the same component of [x, y] \ Ty, we have dy(a, b) < M; and 

• if V rh W then Ty n T w = 0. 

We should note that the interval Ty is not uniquely defined. We also note that 
the second condition says that on the complement of the union of thin intervals the 
point lies in the eo-thick part of T(S). If Ty ^ we will say that V is thin along 
Ty. In particular if dy(x, y) > M, then the interval Ty ^ 0. 

We will write Ty < T^y along [x, y] if both endpoints of Ty occur before both 
endpoints of T\y when traveling from x to y. This Ty is called the thin interval 
for V (or the active interval in some papers). Note that for us thin intervals are 
segments in Teichmuller space, whereas Rafi works with the corresponding time 
intervals Iy C R. The geodesic [x, y] is suppressed in the notation Ty, and so 
the same notation re-occurs when there are multiple segments in an argument; the 
geodesic with respect to which the interval is defined should be clear from context. 
We will take M to be large enough to be a valid threshold in the distance formula 

The properties of thin intervals imply the following time-ordering principle. 

Lemma 11 (Time ordering [19]). Choose any constant Mo > M. Consider a 
pair of geodesic segments in T with a common basepoint x which end in y, y' , 
respectively. Suppose V rtl W are transversely intersecting subsurfaces of S. 

(1) If dy{x,y),dy{x,y') 1 dw{x,y),dw{x,y l ) > 3Mq, then the thin intervals 
Ty , T(y appear in the same order along both geodesies, as they are traced 
out from x . 

(2) If dy{x,y),dw{x,y),dw{x,y') > 2>Mq and Ty appears before Tw along 
[x,y], then V determines a thin interval along [x,y'] which appears before 
T w . 

Proof. Assume the pairs of endpoints have large projection to V and W as in the 
hypothesis of the first statement, and suppose Ty appears before T^y along [x, j/]. 
Since the endpoints of Ty contain dV in their markings, we have dw{%> dV) < Mq. 
If the intervals appear in the opposite order along [x, y'], then letting z be the 
endpoint of Ty closest to x, since z contains dV in its marking, we use the triangle 
inequality to get 

d w (x, dV) = d w (x, z) > 3M - M = 2M , 

a contradiction. This proves the first statement. Turning to the second statement, 
the assumption on [x, y] gives us that 

d v (x,dW) > 2M . 
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Let z' be the endpoint of TV along [x, y'} closest to i. It contains dW in its 
marking. We therefore have 

d v (x,z') > 2M > M, 
and so Ty must appear between x and z' along [x, y']. □ 

2.6. Reverse triangle inequality. We will repeatedly use the fact that the pro- 
jection of a Teichmuller geodesic to the curve complex of any subsurface forms 
an unparameterized quasi-geodesic that, in particular, does not backtrack. This 
phenomenon is captured by the following "reverse triangle inequality," which was 
proved first in the case of the curve complex of the whole surface by Masur-Minsky 
[T5] and then for general subsurfaces by Rafi [TH1 Thm B] . 

Lemma 12 (Reverse triangle inequality). There exists B > such that for any 
nonannular subsurface V (including S itself) and for any geodesic interval [x,y] 
and any subinterval [a, b] C [x, y] we have 

(4) dy(x, a) + d v (a, y) < d v (x, y) + B, and 

dv(a, b) < dy(x, y) + B. 

For an annulus A, these inequalities hold with the twisting distance dc(A)> but not 
necessarily with the projection distance dA- 

In the exceptional annulus case, we have the following theorem from Rafi [18] . 

Theorem 13 (R.T.I, exception). For any sufficiently large Mq, there exists a con- 
stant B' > with the following property. For any geodesic segment [x, y] and any 
annulus A, if a £ [x,y] is such that dA{x,a) + dAifl,y) — dA(x,y) > B' (i.e., 
the reverse triangle inequality fails), then there exists a proper subsurface V =/= A 
containing a family of subsurfaces Wi C V such that 

• dV = dA, 

• the W t fill V, 

• for each Wi the reverse triangle inequality Q holds along [x, y], 

• dwt (a, y) > Mq for each i, 

• d A (a,y) < J2 d Wi(a,y)- 

We remark that this is not exactly how the result in [TB] is stated. Rafi finds 
an annulus about the short curve which with respect to the quadratic differential 
is a disjoint union of a flat annulus and an expanding annulus. Each is foliated by 
equidistant lines. In the flat annulus case, the lines are geodesies of the quadratic 
differential and have curvature. In the latter case they have negative curvature. 
Rafi measures the path traveled in H 2 defined by the length and twist coordinates 
by computing the modulus of these annuli. He shows that the distance traveled in 
H due to the expanding annulus is much smaller than the Teichmuller distance 
and if the reverse triangle inequality fails, it is due to the presence of an expanding 
annulus whose modulus is much bigger than the modulus of the flat annulus. The 
fact that path length in H 2 is much smaller than Teichmuller length forces, by 
his distance formula, the presence of the domains Wi as in the statement of the 
theorem. 

Going forward, we fix once and for all a constant M large enough to satisfy 
the quantitative parts of the thinness statements, the distance formula, and these 
reverse triangle inequality statements. 
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3. Comparing measures 

To address averaging questions, one of course needs to consider a measure. In the 
present context of metric geometry, it is perhaps most natural to consider Hausdorff 
measure of the appropriate dimension. 

Definition 14 (Hausdorff measure). The n-dimensional Hausdorff measure on a 
metric space will be denoted by 77. It is defined by 

rj(E) := Km [inf ^ diam(J7i)" , 

where the infimum is over countable covers {Ui} of E with diamt/i < S Vz. 

For the Teichmiiller metric, there is a nontrivial ^.-dimensional Hausdorff measure 
(recalling h = 6g — 6). As we shall see, in order to understand average distances 
with respect to this measure, it will be necessary to compare with other measures, 
defined below, which are also natural to consider in their own right. 

3.1. Measures on Finsler manifolds. The Teichmiiller space carries several nat- 
ural volume forms coming from its structure as a Finsler manifold. Let us discuss 
these general constructions first before returning to the case of M = T(S). The 
treatment closely follows the survey by Alvarez and Thompson [T]. 

Recall that a Finsler metric on an n-dimensional Finsler manifold M is a con- 
tinuous function F : T(M) — > K that restricts to a norm on each tangent space 
T X (M). There is a dual norm on each cotangent space T*(M). For a point x E M, 
let B x C T X {M) and B* C T*(M) denote the unit balls for these two norms. A 
local coordinate system (xi, . . . , x n ) on M induces a pair of isomorphisms 

(5) <f>: T X (M) -> R n and ip: T*(M) -> 1™ 

defined by writing vectors and covectors with respect to the dual bases {d Xl , . . . , d Xn } 
and {dxt, . . . , dx n }. By definition of the dual norm, the pairing T X (M) x T*(M) — > 
K is sent to the standard inner product on R" under these isomorphisms. In the 
local coordinate chart we may now define two functions 

\{4>(B X )) e n 

where A is Lebesgue measure and e n := A(Ball") is the Lebesgue measure of the 
standard unit ball in R n . While these functions clearly depend on the choice of 
coordinates {x\, . . . ,x n ), one may easily check that the n-forms 

f(x) dx\ A • • • A dx n and g(x) dx\ A • • • A dx n 

are independent of the coordinate system and therefore define global volume forms 
on M . The former is called the Busemann volume on the Finsler manifold and the 
latter is the Holmes-Thompson volume; see pQ for more details. These both define 
measures on M . 

A third measure to consider is the one induced by the canonical symplectic 
form uj on the cotangent bundle, defined as follows. Consider local coordinates 
(x\, . . . , x n ) defined in a neighborhood U C M. The 1-forms dx\, . . . , dx n then 
give a trivialization of T*(M) over U, and we have a local coordinate system on 
T*{M) given by 

(6) (xi,yi,...,x n ,y n ) i-> . . . ,x n ), y^Vi dxj j . 
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In these coordinates the canonical symplectic form may be written simply as uj — 
^2dxi A dyi. Taking exterior powers then yields a volume form /i sp — uj n /n\ on 
T*{M). By restricting to the unit disk bundle T*'- 1 (M) and pushing forward by 
the projection n : T*{M) — > M, we obtain a symplectic measure n on M. 

Finally, a Finsler metric on a smooth manifold M n induces a path metric d in 
the usual way, and this in turn gives rise to a Hausdorff measure in any dimension. 

Recall that a centrally symmetric convex body f2 C K ra determines a polar body 
tt° C (K™)* = R n via 

n° := {£ G E™ |e-«<lV«efl}. 

The Mahler volume of fi is then defined to be the product M(O) := A(i7)-A(il°) of 
the Lebesgue volumes of f2 and Q°. For any centrally symmetric convex body J7, 
it is known that 

(7) ^<M(fi)<4 = M(BaU"). 

The first inequality was established by John ;9j, and the latter, which gives an 
equality if and only if the norm is Euclidean, is known as the Blaschke-Santalo 
inequality [4]. 

Theorem 15 (Assembling facts on Finsler measures). Suppose that M n is a con- 
tinuous Finsler manifold. Then 

• the Busemann measure ^t B and the n-dimensional Hausdorff measure n are 
equal; 

• the Holmes -Thompson measure /i HT and the symplectic measure n are scalar 
multiples: fi HT — 7-n; 

• Mht < Mb < (n n/2 ) /i HT ,with equality of measures if and only if the metric 
is Riemannian. 

Note that it is still possible for fi HT and fi B to be scalar multiples of each other 
in the non-Riemannian case, for instance on a vector space with a Finsler norm. 

Proof. The first statement was originally shown by Busemann in the 1940s in [S] 
and is stated in modern language in [TJ Thm 3.23]. 

The second statement is straightforward and we include a proof for completeness. 
Working in the local coordinates and applying the Fubini theorem, we see that the 
Holmes-Thompson volume of a subset EcMis given by: 

g{x) dxi A • • • A dx n = [ ( [ — d\ \ dx\ A • • • A dx n 

dyi A • • • A dy n A dx\ A • • • A dx n 



For the third statement, recall that the measures are defined by 

H B (E) = / f{x)dx\/\---/\dx n and fi HT (E) = / g(x) dxi A • • • A dx n . 
J e J E 

For each x £ M, the unit ball B x C T X (M) is sent to a centrally symmetric 
convex body (j){B x ) C M™ under the isomorphism <f> defined in ((5]). The polar body 
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is exactly given by tp(B x )° = t/j(B*). Therefore, the Mahler volume of <fi(B x ) is 
M{cp{B x )) = X^(B x ))-X^(b:)) = el^. 

Combining with (JT)) now implies that n~ n / 2 f(x) < g(x) < f(x) for all x G M. 
We conclude that /i HT 

(E) < fi B (E) < n n / 2 fi HT {E) for all E C M. Finally since 
Blaschke-Santalo can only give equality for a Euclidean norm, it follows that /z B 
and /i HT can only be equal for a Riemannian metric. □ 

3.2. Measures coming from quadratic differentials. Recall that quadratic 
differential space Q(S) is naturally identified with the cotangent bundle T*(T(S)) 
of Teichmiiller space, and that each quadratic differential q € Q{S) has a norm 
given by the area of the fiat structure on S induced by q. The unit disk bundle for 
this norm will be denoted by 

Q^(S) = {qeQ(S):\\q\\<l}. 

Using this disk bundle, the natural symplectic measure /i Bp on Q(S) descends to a 
measure n on T(S) exactly as above. We note that u and therefore [i Bp and n are 
invariant under the action of the mapping class group. 

The space Q(S) also carries a natural Mod(S')-invariant measure ^ hol that is 
defined in terms of holonomy coordinates and which we will refer to as holonomy 
measure; it is also sometimes called Masur-Veech measure in the literature (see [T2J 
for details). This measure has been studied extensively, for instance to establish 
crgodicity results for the geodesic flow. The measure fi hol is also related to the 
"Thurston measure" fi TH on the space of measured foliations M.T induced by the 
piecewise-linear structure of M.T [8]. Indeed, as seen in [12] . /x hol is equal to the 
pullback of yU TH x ^ TH under the Mod(5')-invariant map Q{S) —> M.T x MF that 
sends a quadratic differential to its vertical and horizontal foliations. 

Just as fi sp induces n, the holonomy measure ^t hol descends to a measure m on 
T(S). Explicitly, the m-measure of a set E C T(S) is given by 

m{E) :=Mho. (^(E) D Q^(S)) . 

This measure m has been studied previously in [3] and [7]. 

Proposition 16. [14, p. 3746] There is a scalar k > such that /i sp = fc-/i hol . 

We recall the outlines of the argument here. In [14], it was shown that the 
Teichmiiller flow on Q(S) is a Hamiltonian flow for the function 

As such, the Teichmiiller flow preserves the symplectic form oj and the corre- 
sponding measure fi sv . The measures fi sp and /x hol both descend to the quotient 
space Q(S)/ Mod(S'); furthermore, the latter defines an ergodic measure for the 
Teichmiiller flow on Q(S)/Mod(S) [12]. Since /i Bp is absolutely continuous with 
respect to jj, boU the result follows. 

We therefore also have n = fcm, and combining Proposition [TBI with Theorem [T5l 
we get: 

Corollary 17. There are scalars > &i > such that 

k\m < rj < k2m. 



14 



DOWDALL, DUCHIN, AND MASUR 



3.3. Visual measures. The unit sphere subbundle of Q(S) will be denoted by 

Q 1 (S) = { q e Q(5): ||g|| = l}. 

For each x e T(S), the fiber Q 1 (x) is identified with the "space of directions" at x, 
and the Teichmiiller geodesic flow ip t : Q(S) — > Q(S) gives rise to a homeomorphism 

Q 1 (x)x(0,oo) -> T(S)\{x} 

which serves as "polar coordinates" centered at x. Furthermore, this conjugates ip t 
to a radial flow based at x given by 

<Pt(n(<Pr{q))) ■= n(<f r+t (qj). 

We will consider measures on T(S) that are compatible with these polar coordinates 
and with the radial flow. 

Definition 18 (Visual measure). Given any measure k x on the unit sphere Q (x) = 
S h ~ 1 , we define the corresponding visual measures on S r (x) and T(S) as follows. 
Firstly, the visual measure Vis r (/t x ) on the sphere S r (x) of radius r is just the push- 
forward of e hr K x under the homeomorphism Q 1 (x) x {r} = S r (x). Integrating these 
over (0, oo ) then gives a visual measure on T(S) defined by 

Vis(k x )(E) := f e hr dK x (q)d\(r). 

J(q,r)eEcQ 1 (S)x(0,oo) 

Said differently, Vis(K x ) is equal to the push-forward of k x x A under the home- 
omorphism \I> X , where Ao is the weighted Lebesgue measure on (0, oo) given by 
A ([a, b}) = e hr d\(r) = (e hb — e ha )/h. (We have scaled things in this way so that 
the visual measure of the ball of radius R grows like e hR .) 

The essential feature of visual measures is that they enjoy the following "nor- 
malized invariance" under the radial flow: For any t > and measurable E C S r (x) 
we have 

Vis r +t(K x )(<pt(E)) _ Vis r (K x )(E) 
Vis r+t {K x )(S r+t {x)) Vis r (K x )(S r (xj) ' 
The same invariance holds for Vis(k x ) when we normalize with respect to annular 
shells Bb(x) \ B a (x) instead of spheres. 

There are two visual measures that specifically interest us. Firstly, the normed 
vector space Q(x) carries a unique translation- invariant measure v x normalized so 
that v x {B*) — 1; recall that the unit ball B* is just the intersection Q- 1 (5') fl Q(x). 
This induces a measure (also denoted v x ) on the unit sphere Q 1 {x) via the usual 
method of coning off: v x {E) := v x ([0, 1] • E) for E C Q 1 (x). 

Secondly, since Q(S) has the structure of a fiber bundle over T(S), we can define 
a conditional measure s x on Q(x) by disintegration from fi hol . More precisely, s x is 
the unique measure on Q(x) such that the /i hol -measure of E C Q(S) is given by 

(E) = / s x (EDQ(x))dm(x). 
Jt{s) 

Via the process of coning off, we again think of s x as a measure on Q 1 {x). 

The space Q(S) of quadratic differentials is a complex vector bundle; as such, 
there is a natural circle action S 1 rx Q(S) that preserves each fiber Q(x) and 
unit sphere Q 1 (x). We say that a visual measure Vis^u,) is rotation-invariant if 
the corresponding measure n x on Q 1 (x) is invariant under this action of S 1 . The 
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visual measure Yis(v x ) is rotation-invariant because S 1 preserves the unit ball B x . 
Similarly, Vis(sa;) is rotation-invariant because S* 1 preserves /z ho i- 

3.4. Summary. The measures on T(S) considered above are n and m (induced 
by the symplectic and holonomy measures on Q(S), respectively, via the covering 
map), Hausdorff measure 77, the visual measures Vis^^) created by radially flowing 
measures on the sphere of directions Q}(x), and the measures /i B and /i HT coming 
from the Finsler structure. 

We found that n, m, and /i HT are scalar multiples of each other, Hausdorff mea- 
sure and Busemann measure coincide, and all five of these are mutually comparable 
in the sense of being bounded above and below by scalar multiples of each other. In 
the following section we will establish results about the structure of generic geodesic 
rays with respect to these measures and the visual measures. 



4. Volume asymptotics and random walks 

4.1. Volume estimates. Athreya, Bufetov, Eskin, and Mirzakhani j3] have found 
the following asymptotic estimate for the m-volume of a ball of radius r. 

Theorem 19 (Volume asymptotics [3 ). There exists a (bounded) function f : 
T(S) — ¥ (0,oo) such that for each x £ T(S) 

r— foo e 

Corollary 20 (Definite exponential growth). Let denote holonomy measure m, 
Hausdorff measure rj, or any visual measure /i x = Vis(n x ). For each x G T(S), 
there exist constants C\ < Ci such that for all sufficiently large r ( depending on x) 
we have 

de hr < (J,(B r (x)) < C 2 e hr . 

Proof. This is built into the definition of the visual measure Vis(ft x ). For the 
holonomy measure m, this follows from Theorem [19] above. The same estimate 
then holds for Hausdorff measure 77 by Corollary [17] □ 

For any r > k > 0, let A*(x) = B r {x)\B r -k{x) denote the annular shell between 
radii r and r — k. The fact that the volume of a ball grows exponentially in the 
radius means that we can focus our attention on annuli rather than on balls. 

Lemma 21 (Reduction to annuli). Fix x £ T(S) and let \i denote holonomy 
measure m, Hausdorff measure r\ 7 or any visual measure Vis(K x ). Suppose that for 
all k > we have 

1 1 f 

Then the same holds when A^(x) is replaced by B r (x). 
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Proof. Let C\ , C2 be as in Corollary [20] above. For each k sufficiently large (satis- 
fying §^e~' lfe < 1) and all sufficiently large r we have 

1 1 f 

2 > , , „ 2 / d r {y, z) d[i(y)dfi{z) 

- ik 1 \\ ik 1 w Tiki \\2 \ d T {y,z) dfi(y)dn{z) 

> ( 1 - %- hk ) - 1 / d T (y, z) d»(y)d»(z). 



The claim now follows since, by assumption, the latter becomes arbitrarily close to 
2 when r and k are sufficiently large. □ 

4.2. Random walks. In this section we follow the ideas of Eskin and Mirzakhani 
on discretizing geodesies into random paths. We begin by combining some results 
on the volume of balls from Athreya-Bufetov-Eskin-Mirzakhani 3] and Eskin- 
Mirzakhani [7]. 

Lemma 22 (Volume of balls j3[ Theorem 1.2], Lemma 3.1]). For a fixed thick 
part Te a , there exist global constants c and C such that 

m(B r (y)) < Ce hr 

whenever y € Te OR r < c. 

Proof. Choose a point x £ % . By the volume asymptotics (Theorem [T9|). there 
exist constants Rq and Co such that 

m(B r (x)) < C n e hr 

for all r > Rq. Furthermore, by increasing i?o if necessary, we may assume that 
the Mod(5')-translates of Br (x) cover T tQ . It follows that for any y £ % and any 
r > we have 

m(B r (y)) < m(B r+Ro (x')) = m(B r+Ro (x)) < C e hRo e hr 

for some Mod(S')-translate x' of x. This establishes the first claim. 

As for the second claim, Lemma 3.1 of [7] says that there exists constants c and 
C\ such that m(B c (y)) < C\ for all y £ T(S). In particular, for any r < c we have 

m(B r (y)) < m(B c (y)) <C X < C x e hr . 

This proves the Lemma with C = maxjCoe''^ , C\). □ 

To define a random walk on T(S) with basepoint x, first choose a net Af of 
points in T(S), choosing so that x £ A/", such that the points are c-separated and 
(2c)-dense (i.e., the distances between net points are at least c but the (2c)-balls 
about net points cover Teichmuller space). 

Given a parameter r, a sample path of length s (starting at x) is a map 

A: {0 7 ...,[s/t\}^N 

such that A(0) = x and for each index, dr(A(fc), A(fc + 1)) < r. Let P r (s) be the 
set of sample paths A of length s, and let P r be the set of all sample paths of any 
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length s > 0. By (36) of |7], for any 5 > and sufficiently large r (depending on 5) 
we have, 

|P T (s)| < e s ( h+5) 

for all s > 0. (Note that the constant C2 in (36) of [7], coming from [7J Proposition 
4.5], can be taken to equal i.) 

We now define a map F: T{S) — > P T which takes a point y and "discretizes" 
the geodesic [x, y] to a sample path. For any [x, y] we mark off points along the 
geodesic starting at x and spaced by time r — 2c. For each such point we choose 
a nearest point in Af. This is the sample path associated to y. For any t\ > 0, 
we can choose r sufficiently large so that the image under F of the sphere S s (x) is 
contained in P T (s(l + ei)). 

5. Separation and thickness statistics for geodesic rays 

In order to establish distance estimates in fj6l it will be necessary to restrict our 
attention to pairs of geodesic rays based at x that both spend a definite fraction 
of the their time in % and stay far apart beyond some threshold radius. In this 
section we show that most pairs of geodesies have both of these properties, with 
respect to all measures we consider. 

5.1. Thickness statistics. Our techniques for analyzing Teichmiiller geodesies 
rely heavily on controlling the fraction of time they spend in the thick part 7l - 
We call this quantity the thick-stat; for a geodesic segment 7: [0,L] — > T(S) it is 
denoted by 

Th % (7) = |{te[0,L]: 7 (t)gr to }| 

For distinct points x,y £ T(S), we will use Th^[x,y] to denote Thj^f/y), where 
7: [0, dj-(x, y)] — > T(S) is the Teichmiiller geodesic from x to y. The goal of this 
subsection is to show that the thick-stat Th^ [x, y] is uniformly bounded below for 
most y £ A!j:(x) for the measures of interest. 

Definition 23. We say a measure [i on T(S) satisfies a thickness estimate if there 
exist constants S, eo > such that for every x £ T(S), e > 0, and < a < 1, we 
have 

fx ({y £ A*(x) : Th%[x,j(t)] < S for some t £ [ar,r]}^j < e 
for sufficiently large r. 

First we observe that such an estimate for visual measures is essentially imme- 
diate from the ergodicity of the Teichmiiller geodesic flow ip t . Fix a measure k x 
(either s x or v x ) on Q 1 (x) and an eo-thick part T eo - 

Proposition 24 (Thickness statistics for visual measures). There are constants 
<5, e > such that for all e > and all x £ T(S), there exist a threshold Rq and 
a set E C Q 1 {x) with k x (E c ) < e so that for all q £ E and r > Rq we have 
Th*[s,7rfov(?))]>«. 

Proof. By the ergodicity of the geodesic flow [TJ] there is S > such that the 
geodesic determined by almost every q £ Q(S) spends proportion S of its time in % , 
asymptotically. The vertical foliation of each such q is uniquely ergodic [13] . If two 
quadratic differentials have the same vertical uniquely ergodic measured foliation 
then they are forwards asymptotic pj]. We conclude that almost every measured 
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foliation F G M.T (with respect to Thurston measure /] TH ) has the property that 
for any quadratic differential with vertical foliation F, the corresponding geodesic 
spends at least d proportion of its time in the thick part, asymptotically. 

The map Q(x) — > M.J- which assigns to q its vertical foliation is a smooth map 
off the multiple zero locus, so it is smooth on a set of full measure. Thus it is 
absolutely continuous with respect to the measures k x and fJ. Ta . Thus the property 
of asymptotically spending proportion S in the thick part is true of almost every 
q G Q(x)- Thus for every e, long enough geodesies are thick for proportion S of 
their length away from an exceptional set of k x -measure at most e. □ 

We next verify a thickness estimate for m. 

Theorem 25 (Thickness statistics for holonomy measure). For all < 9 < I, there 
exists e > such that for any < a < 1 , there exists n > such that for all k and 
all sufficiently large r, 

m ({y G A k r {x) : Th*[x,7(i)] < for some t G [ar,r]}) < Ce^ 1 "^, 

where 7 is the geodesic based at x and passing through y. 

Proof. We start with a geodesic 7 connecting x to y 6 A%(x). For any ei, this is 
associated to a sample path A in P T (r(l + ei)), as in the last section. We quote the 
proof of Theorem 5.1 of [7] (which itself quotes [2]) to say that given 9, there exists 
5' > and eo such that for all large r, with the exception of at most 

e _5't e fcr(l+ei) 

sample paths A, we have the estimate 

T^j|{l<i< \t/r\ :A(i)eT eo }|>0. 
Now given a, choose ei small enough (forcing r to be large) so that 

5/(7 

K := — ei > 0. 

h 

We conclude that for t > ar with the exception of at most 

e -S't e hr(l+c 1 ) < e hr{l-S'<T/h+^) < gfer(l- K ) 

net points, the geodesies starting at x and ending at the net points have Th" > 9 
for ending times t > ar. Now since we know that the ball of radius c has m measure 
at most C, we are done, because all bad endpoints of geodesies y G A^(x) are within 
c of bad endpoints of sample paths. □ 

This says that, except for set of endpoints y of exponentially small measure, 
geodesies have the property that they eventually have spent a definite fraction of 
their time in the thick part. This exponential bound makes the statement stronger 
than what is required for a thickness estimate. 

5.2. Fellow-traveling statistics. We need an estimate for how likely it is for two 
geodesies to fellow-travel past some radius Rq. The appropriate sort of estimate 
will hold provided that the measure satisfies the following property: 
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Definition 26. We say a measure [i on T(S) satisfies an exponential decay estimate 
if given any a, Mo > there exist C,Ro,a > such that for all t, r with Rq < ar < 
t < r we have 

fi x v{{(yi,y 2 ) € A k r (x) x A k r (x) : rf r (7i(*), 72(f)) < M }) r _ at 
M-4K*)) 2 ' 
where 7^ is the geodesic ray based at x and passing through yi. 

Theorem 27 (Exponential decay for visual measures). All rotation-invariant visual 
measures fi x = Vis(/t x ) on T(S), and in particular Vis(f x ) and Vis(s x ), satisfy 
exponential decay estimates. 

Proof. Choose ar < t < r, fix a point y\ € A^(x), and let E — {y 2 € A k {x) : 
^r(7i(*))72(t)) < M }. Looking instead in the sphere S t (x), we have the set 
E' = {ze S t (x) : d r (7i(t),z) < M }. Notice that, by definition, 

E= (J Vs(E'). 

[r — t — k,r — t] 

(Recall that Lp s denotes the radial geodesic flow based at x.) Therefore, by the 
normalized invariance, we have 



H(E)= f Vis s (K x )(vs-t(E'))d\(s) 

J r — k 



-I 

J r 



Vis t ( Kx )(E — dA(s) 
-A, Vis t (K x )(5 t (a;)) 



Ks(Q 1 (a;)) 

where, in the last line, we have identified E' with its image in Q (x) = 5j(x). 

It remains to find C, i?o (independent of y\) such that k x {E') / k x {Q} (x)) < Ce~* 
when t > Rq. Recall that S 1 acts freely on Q 1 {x) by rotations. Choosing orbit 
representatives, we may realize Q}(x) as a setwise product (Q 1 (x)/S 1 ) x S* 1 . The 
measure k x pushes forward to a measure on Q 1 (x)/S 1 . By disintegration, we then 
obtain a measure on each fiber S 1 which, by the rotation- invariance of k x , must 
agree with Lebesgue measure up to a scalar. For any two points z,z' € E', the 
triangle inequality gives df(z,z') < 2M . Now suppose that z and z' lie in the 
same Teichmiiller disk, meaning that the unit quadratic differentials associated to 
the geodesies [x, z] and [x, z'\ lie in the same S' 1 -orbit. Each Teichmiiller disk is 
an isometrically embedded copy of the hyperbolic plane. Thus, when t is large 
compared to Mo, hyperbolic geometry implies that the fraction of each S^-orbit 
contained in E 1 is at most Ce~* for some constant C. Using the product struc- 
ture and integrating over the Q 1 (x)/S 1 factor, Fubini's theorem then implies that 
k x (E')/k x (Q}(x)) < Ce~* as well. □ 

Theorem 28 (Exponential decay for holonomy measure) . The measure m satisfies 
an exponential decay estimate. 

Proof. For a given r, t, let 

D r ,t = {(2/1,2/2) G A k r {x) x A k r {x) : d T {li{t), l2 {t)) < M } 
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denote the the set in question. Set 8 = 1 /2 and choose eo and K > as in Theoreml25l 
so that the m-measure of the set 

E r = {yi E A k r (x) : Th^ar.TiC*)] < 9 for some t £ [ar,r]} 

is at most Ce' lr( - 1 ~ K - 1 for all large r. We may now write D r t = D' r t U D" t , where 

D' r ,t = {(»i,Ift) e D P ,t : Vi e -Er} and = {(yi,y 2 ) 6 A-,t : Z/i £ #r}- 
By the above, we have that 

m x m(DL t) , M s , , . 

(8) . Al \ !' tJ < Ce hr(1 - K ^ < Ce hr e- hKt 

for all large r and alH < r. 

Choose any point y\ € A k (x) \ E r and fix some t € [or, r] . Then any point 
U2 G satisfying d-r(-fi(t),j 2 (t)) < M must be contained in the ball of radius 

r — t + Mq about 71 (t). Since t > err and yi $ E r ; we know that Th* [x, 71 (f)] > 9. 
Choosing any < a < 9, there must exist a time t' € [cci, 4] for which 71 (t') € 7^ . 
Furthermore, each such j/2 lies within the ball of radius (1 — a)t + Mq + r — t about 
7i(f'). Applying Lemma l22| we see that 

m x m(D'' t ) , ,,, , , , 

(9) 77^ ^ m B r _ at+Mo 71 *' < Ce'^'e^e-""'. 

m(„4~(a;)) 

The claim now follows from ([5]), ^ and the fact that, by Corollary [301 there exists 
a constant C\ such that m(A k (x)) > C\e hr for all large r. □ 

Finally, we see that for any measure satisfying an exponential decay estimate, 
most pairs of geodesic rays are in fact never near each other beyond some threshold. 

Proposition 29 (Separation is forever). Suppose a measure fi satisfies an exponen- 
tial decay estimate. Then for any < a < 1 and Mq > there exist RQ,C,a > 
such that for all r > Rq the (i x /i-measure of the set 

{(3/1,3/2) € A k {x) x A k ;{x) : dr(7i(*),72(*)) < M /or some i € [or,r]} 

is bounded by n(A k {x)) 2 Ce~ aar , where here % denotes the geodesic ray based at x 
and passing through yi . 

Proof. If 2/2) is such a point, then there is some n £ N, n < (1 - a)r such that 
^T(7i(°" r + ri );72(or + n)) < Mq + 2. Thus our set of points is contained in the 
union of exceptional sets corresponding to the radii err, or + 1, . . . , err + L r — °" r J ■ 
Using the exponential decay estimate, we see that our set has measure at most 

^{A k r {x)fC (e- aar + e - Q ( CT ' r+1 ) + • • • + e -a(<n-+[r-»rA 

Thus we can conclude that after throwing out a subset of A k (x) x A k (x) of mea- 
sure which is an arbitrarily small proportion, all pairs of geodesies stay separated 
by an arbitrarily chosen distance in Teichmuller space after a threshold time or 
has elapsed. Later we will show that after waiting even longer we may also assume 
that every pair of geodesies has big curve complex distance. This relies both on 
the large Teichmuller distance established above and the fact that most geodesies 
spend a definite fraction of their time in the thick part. 
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6. Distance estimates 

6.1. Progress in the curve complex. The goal of this subsection is to prove 
Proposition [351 which says that any geodesic that spends a definite fraction of its 
time in the thick part must move at a definite rate in the curve complex. The idea is 
that long subintervals contained in T eo contribute to progress in C(S); alternately, 
one could consider intervals in the complement of IJyTy. For this analysis, we 
would like to bound the number of connected components of IJy Ty in terms of 
ds{x,y). One bound is given by the number of nonempty thin intervals. While 
there may be arbitrarily many such Ty, some of these will be redundant in the 
sense that Ty C T^y for some other subsurface W . 

Definition 30 (Thin-significance). Recall the choice of global constant M. A 
proper subsurface V C S is said to be thin- significant for the geodesic segment [x, y] 
if dc(v){ x i y) > 3M and for every other proper subsurface Z C S with dc(z)( x , y) > 
3M we have Ty £ T z . 

Remark. In this subsection we will focus on the curve complex distance dc(y) f° r a 
subsurface V . Recall that this agrees with the usual projection distance dy in the 
case that V is non-annular, but that c?c(A) an d ^A differ for annuli. We will take 
care to handle exceptional annuli carefully. 

Our next goal is to bound the number of thin-significant subsurfaces along an 
arbitrary geodesic. For this, we will use the work of Rafi-Schleimer [20] bounding 
the size of an antichain in the poset of subsurfaces of S. 

Definition 31 (Antichain). Given a subsurface ScSa pair of points x, y € T(S) 
and constants T\ > Tq > 0, a collection D, of proper subsurfaces of S is an antichain 
for (E,x,y,T ,Ti) if the following hold: 

• if Y, Y' € £1, then Y is not a proper subsurface of Y'\ 

• if Y £ Q, then dc(y)(x,y) > To; and 

• if Z C S and d C (z){x,y) > Ty then Z cY for some Y e fi. 

Lemma 32 (Antichain bound [50] Lem 7.1]). For every S C S and sufficiently 
large T\ > To > 0, there is a constant A — A(E,Tq,Ti) so that iffl is an antichain 
for (£, x, y,T ,Ti) then 

< A-d cm (x,y). 

We now prove a proposition showing that if there are a large enough number of 
thin-significant subsurfaces along a geodesic, then the image of the geodesic makes 
definite progress in the curve complex. The following notation will be used in the 
proof. 

Definition 33. Consider a geodesic segment [x,y] C T(S) and a collection £1 of 
proper subsurfaces of 5*. We will consider three partial orders on the set f2: 

(1) V <i W ^ V c w, 

(2) V < 2 W <=> Ty C T w , and 

(3) V < 3 W ^ V c W and Ty c T w . 

The subcollection of f2 consisting of maximal elements with respect to <* will be 
denoted (fi)*; notice that these sets are related by (O)i C (fi)3 D (^)2- Elements 
of (f2)i are said to be topologically maximal with respect to f2. 
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Proposition 34 (Progress from thin-significant subsurfaces). For any to, there 
is a constant N such that if dc(s)( x ,y) is to, then the number of thin- significant 
subsurfaces Y along [x,y\ is at most N. 

Proof. Let fl = {V C S : dc(v){ x ,y) > 3M} be the collection of proper subsurfaces 
which have a large projection. By definition, the set of thin-significant subsurfaces 
is exactly given by (£1)2- On the other hand, the subcollection (fl)i of topologically 
maximal subsurfaces clearly forms an antichain for (S, x, y, 3M, 3M). By Lemma I3"2l 
we therefore have < Atg for some constant A. We will extend this to a bound 

on the cardinality of the larger set (H) 3 ; this will imply the proposition because 
(fi) 2 c(fi) 3 . 

Fix a proper subsurface W £ fl and consider the set Uw = {V £ (fl) 3 : V C W}. 
We claim that \Uw\ is bounded by a constant depending only on the complexity 
of W . By the above, this will suffice because each V £ (0)3 is either equal to or 
properly contained in some topologically maximal proper subsurface W £ (O)i. 

First consider those V £ Uw for which Ty fl Tw 7^ 0- The definition of <3 
implies that Ty <£_ Tw\ therefore Ty must overlap with at least one endpoint of 
T^y- If Tvi and Ty 2 both contain the initial endpoint of T^y, then Ty HTy, 7^ 
and so we cannot have V\ itl V2. Since there is a universal bound on the number 
of subsurfaces such that no two intersect transversely, this bounds the number of 

V £U W for which Ty n T w ^ 0. 

It remains to bound the number of V £ Uw for which Ty n T^y = 0; we will only 
focus on the case Ty < T^y. Suppose that Tw — [a,b] C [x,y] and consider the 
set ft' — {V £ : V C W and d C (y)(x,a) > 2M}. Notice that the subcollection 
(fi')i forms an antichain for (W, x, a, 2M, 4M): the only difficulty is to check that 
every Y C W with dc(Y)(x, a) > 4M is contained in an element of (fl')i- However, 
this is true because the reverse triangle inequality guarantees that d C (Y)(x,y) > 
d C (Y){x 1 a) — B > 3M and therefore that Y £ fl'. Since d C ( W )( x , a ) < M, Lemma I3"2l 
now gives a bound on |(f2')i|. Finally, notice that for each V £ Uw with Ty < Tw 
the triangle inequality gives dc(v) i x , a ) ^ dciv) ( x ,y)~M > 2M and so ensures that 

V € fl'. Therefore each such V is contained in some topologically maximal Z £ fl'; 
that is to say, each V £ Uw with Ty < Tiy is contained in Uz for some Z £ (fl')i- 
The bound on \Uw \ now follows by induction on the complexity of the subsurface 
W. □ 



Definition 35. Define a constant P := 36 max |log + (l/eo), log + (3M)}. Say that an 
annular subsurface A has an exceptional thin interval Ta along [x, y] if dc(A)i x , y) is 
3M but dA{x,y) > P. By Lemma [9l this is only possible if l x (dA) < e or l y (dA) < 
eq. Therefore, such an annulus must determine a nonempty thin interval along [x, y] 
that contains either x or y. Since all annuli A with l x (dA) < eo must be disjoint, 
we see that there is a universal bound (namely 617 — 6 = h) on the number of annuli 
with exceptional thin intervals along an arbitrary geodesic [x, y]. 

We now define the primary thin portion W of a geodesic segment [x, y] to be the 
union of thin intervals Ty for all non-annular proper subsurfaces with dv{x,y) > 
3M and all annular subsurfaces with dc(A)( x ,y) > 3M or dA( x ,y) > P. In the case 
that ds{x,y) < to, Proposition l34l implies that W is the union of at most N + h 
thin intervals, namely, those corresponding to the thin-significant subsurfaces and 
to annuli with exceptional thin intervals. 
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While W does contain most of the nonempty thin intervals along the geodesic, 
it need not cover the entire time that [x, y) spends in the thin part of Teichmiillcr 
space. Nevertheless, the projections to all proper subsurfaces remain uniformly 
bounded on the complement of W. 

Lemma 36 (Complement of W). There exists a constant M' with the following 
property. If [a, b] C [x, y] \ W is a connected interval in the complement of the 
primary thin portion of [x, y], then dy(a, b) < M' for all proper subsurfaces Y C S . 

Proof. First suppose that Y satisfies the reverse triangle inequality (|4|) along [x, y]. 
If Y is non-annular and dy(x,y) > 3M, or if Y is an annulus and d C (y^(x,y) > 
3M or d Y (x,y) > P, then Ty C W by definition. Therefore d Y (a,b) < M since 
[a, b] n Ty = 0- If this is not the case, then the reverse triangle inequality gives 
d Y (a, b) < d Y (x, y) + B < 4M + P as claimed. 

It remains to consider an annular subsurface A c S for which the reverse trian- 
gle inequality fails. We may assume that dc(A)(x,y) < 3M and dA(x,y) < P, for 
otherwise we have C W and oU(a, b) < M as above. Let B' be the constant 
corresponding to the threshold 5M + P in Theorcm[l3l (R.T.I, exception). According 
to that theorem, applied to the geodesic [a, y], we either have ^(a, b) + ^(6, y) < 
d^(a, y) + B' , or there exist subsurfaces Wi that satisfy the reverse triangle inequal- 
ity and which have dwi{a,b) > 5M + P. However, as we have seen above, there 
are no such proper subsurfaces. Therefore the former inequality must hold. We 
similarly have d(x, a) + d(a, b) < d{x 1 b) + B' . Adding these inequalities and using 
the triangle inequality then gives d,A(a, b) < dA{x, y) + B' < P + B' . □ 

By the distance formula (fT]), it follows that long intervals in the complement of 
W must travel a large distance the curve complex C(S) of the whole surface. The 
following lemma says that each such subinterval contributes to the curve complex 
distance along the total geodesic. 

Lemma 37 (Cumulative contribution of subintervals). There exist constants < 
pi < 1 and D\ > such that for all d > D±, if [x,y] is a Teichmuller geodesic 
that contains n subintervals [xi , yi] with disjoint interiors whose endpoints satisfy 
ds{xi,Vi) > d, then 

ds{x,y) > p\nd. 

Proof. Applying the reverse triangle inequality (j4|) to the points xt and yi we have 
d s {x,Xi) + d s {x i ,y l ) + d s {y l7 y) < d s {x,y) + 2B. By recursively applying this 
observation to [x, Xi] and [yi, y] and then throwing out the complementary intervals, 
we find have that 

ds(x, y) > d s (xi,yi) - 2nB >nd- 2nB. 

Choose D\ > AB and p\ — 1/2. Then for d> D\ the quantity on the right side is 
at least p\nd. □ 

We now fix once and for all a "definite progress" constant D > sufficiently large 
so that piD > D\ (and thus D > D\ as well). Applying the distance formula (fT]) 
with the threshold M' given by Lemma [3^1 we have quasi-isometry constants K, C 
such that d-r(a,b) < Kds(a.b) + C for any connected interval [a, b] C [x,y] \ W. 
This gives rise to a fixed value L such that any interval [a, b] of length at least L that 
lies entirely in [x, y] \ W satisfies ds(a, b) > D; for example, any value L > KD + C 
will suffice. Thus according to Lemma |37l if / is any interval along a geodesic that 
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contains a subinterval of length L that is disjoint from VV, then the distance in the 
curve complex between the endpoints of / is at least piD. 

Furthermore by Proposition [M] associated to the constant to = piD, there is a 
constant N so that the conclusion of Proposition [34l holds. 

Proposition 38 (Definite progress). For each < S < 1, there exist constants 
p, Ri > with the following property. If [x, y] is a Teichmiiller geodesic of length 
r > Ri such that Th e ° [x, y] > 8, then ds(x,y) > pr. 

Proof. Let N denote the constant obtained by applying Proposition [34] with to = 
piD. Choose n so that nS > 1 and make the following definitions: 

, nS-l L(N + fe + l) _ pfD 

d = r> T o > T, > -Ri = 2T , p = 



n- 1 ' " " S> ' ' r 2nT 

Let [a;, y] be a Teichmiiller geodesic of length r > R\ that spends at least Sr in 
the thick part. Set m = [r/T \ and divide [x,y] into m subsegments of length 
r/m > Tq. Let us say that a subsegment [a, b] C [x, y] is stalled if ds(a, b) < piD and 
progressing if ds(a,b) > piD. Suppose that mi of the subsegments are stalled, and 
thus m2 = m — mi are progressing. Given a stalled segment [a, 6], we decompose it 
into its primary thin portion VV and the corresponding complementary subintcrvals. 
Since the interval is stalled, Proposition [34] ensures that W is the union at most 
N + h thin intervals. Therefore we can conclude that W has at most N + /i + 1 
complementary subintervals in [a, b\. Furthermore, each complementary subinterval 
has length at most L, for otherwise we would have dg(a, b) > piD by the preceding 
paragraph. Since W is contained in the thin part, we see that the total amount of 
time that this interval [a, b] spends in the thick part is at most 

(N + h+ 1)L < 6' T < S'r/m. 

Therefore the total amount of time that the full interval [x, y) spends in the thick 
part is at most 

(V — J mi + ( — J m 2 = —(S'mi + m 2 ). 
V mJ Vm/ m 

We claim that m 2 > m/n. If this were not the case, then we necessarily have 

mi > (n — l)m/n. Since 6' < 1, it follows that 

o -mi + l-m 2 < o ■m h 1-m— 

n n 

where the inequality is valid by the elementary fact that for any constants a, b, c, d, a, ft 
such that a + b = c + d and < a < ft we have 

(10) a-a + ft-b < a-c + ft-d <S=> a > c. 

But then the amount of time that [x, y] is thick is less than 

1 1\ (nS- 1 n- 1 1 , 
— 6 m h m— = r — • 1 = ro, 



m \ n n 



which contradicts the assumption on [x, y]. Therefore m 2 > m/n, as claimed. 

On each of the m 2 progressing intervals, the curve complex distance between end- 
points is at least p\D. Therefore, cumulative contribution of subintervals (Lemmal37p 
implies that 

ds(x,y)>Pim 2 ( Pl D)>pjD->^-(^~l] >p^ r r = pr. □ 

n n Wo / 2n±o 



STATISTICAL HYPERBOLICITY IN TEICHMULLER SPACE 



25 



6.2. Separation in the curve complex. We next establish a technical lemma 
which says that if we have a pair of geodesic segments from a common basepoint, we 
can "back up" from the endpoints to earlier points that admit distance estimates 
for both dj- and d$. 

Lemma 39 (Backing up to eliminate thin parts). There exist constants ko and Mq 
with the following property. Suppose 71 , 72 are a pair of geodesic rays based at x and 
let y — 71 (ti) and z — 72(^2)- Then there are times < i'j < t% and corresponding 
points y' — 71 {t\), z' — 72(^2) suc h that 

(i) d s (y,y') < k -d s (y,z) and d s (z, z') < k -d s (y, z); 

(ii) either ds(y' , z') < 6 or dy{y' , z') < Mq for all proper subsurfaces Y . 

We will call these the backup points and backup times for the segments [x, y], [x, z]. 
Furthermore, 

(hi) given any d, there are No and Co such that if df(y, z) > Nq and ds(y, z) < 
d, then the backup points satisfy 

driViV') > c -d T (y,z) or d r (z, z') > c -d r (y, z) 

Conclusion (i) says that the backup points are not much farther from the end- 
points in the curve complex than the endpoints are from each other. The interpre- 
tation of (ii) is that in the distance formula ([1]), a significant contribution is made 
either by the whole curve complex distance or by projection distances to proper 
subsurface (but not both) . These properties of backup points hold in general; (iii) 
says that if the Teichmullcr distance between endpoints is long enough relative to 
their curve-complex distance, then on at least one side the distance backed up was 
significant. (Compare (i) and (iii).) 




Figure 2. Two rays. Some thin intervals are shown, and two thin 
intervals for the same subsurface V\ C Q are marked. 

Proof. First, we will construct backup points satisfying (i) and (ii). Then we will 
verify (iii). 
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Let 

Q = {V C S : d v {y,z) > 10M}. 

If = 0, let y' = y and z' = z and we are done, since (i) is trivially satisfied and 
(ii) works with Mo = 10M. So assume f£ is nonempty. 

For each V £ ft, there is a thin interval Ty along [y, z}. By the triangle inequality, 
for each V £ f2, either dv(x, y) or dy (x, z) is at least 5M and so there is a nonempty 
thin interval Ty along at least one of [x,y] or [x, z], as in Figure [21 

Now let y" and z" be the earliest endpoints on ji of any Ty for 7 6 0. That 
is, we move back in time until we have passed through each thin interval Ty for 
V £ fi. Let us use the notation V\ and V2 for the last surfaces in SI whose thin 
interval one passes through in going from y to y" and z to z", respectively. Thus 
we have a short curve at y" which is also short somewhere along [y, z], which means 
that the image of y" under the projection T(S) — > C(S), which sends a surface to 
its Bers marking, shares a point with the image of [y, z] in the curve complex. By 
the reverse triangle inequality, this means ds(y,y") is bounded above relative to 
ds(y,z), as in (i). 

Now with respect to the new points y" and z", let 

£1" = {Y : d Y {y", z") > 100M + 2B'}, 

where B' is the constant obtained by applying TheoremfTSI (R.T.I, exception) with 
the threshold 50M. Note that f2nf2" = because no V £ has a thin interval along 
[x, y"] or [x, z"], so we would get a violation of the triangle inequality if V were also 
in SI". If VL" = 0, then by setting y' = y", z' = z" , and letting M = 100M + 2B' , 
we are again done. So assume fl" ^ 0. 

Fix some Y £ f2". We know that either dy(x, y") or dy(x, z") is at least 50M+B'; 
let us suppose that the first of these is true. First assume that Y satisfies the reverse 
triangle inequality (Lemma I12j) . Then 

dy(x, y) > 50M + B' - B > 49M. 

Since dy(y,z) < 10M, the triangle inequality then implies 

d Y {x,z) > dy{x,y) - 10M > 39M. 

That is, for any Y £ fi" that satisfies the reverse triangle inequality we have 
dy(x,y),d Y (x,z) > 39M. 

If Y is an exception to the reverse triangle inequality, it must be an annulus, say 
with core curve a. Then by Theorem[T3lwe either have dy(x, y) > d y (x, y") — B', or 
there exists W disjoint from a such that W satisfies the reverse triangle inequality 
along [x,y] and dw{x,y") > 50M. In the former case we have dy(x,y) > 50M 
so that dy(x, z) > 40M by the triangle inequality. In the latter case, we have 
dw{x,y) > dw(x,y") — B > 49M by ([4]). Furthermore, since W must have a thin 
interval along [x, y"], it cannot be in f2 and so we conclude dw(x, z) > 39 as above. 

In summary, to each Y £ 0" we have associated a domain W ^ f2 (equal to Y 
except in the one case) such that both dw(x, y) > 39M and dw(x, z) > 39M. Notice 
that we need only resort to the exceptional case W 7^ Y when dy(y", y) > M (still 
assuming here that dy(x,y") > 50 M + B'), for otherwise we may again conclude 
dy(x,z) > 39 by the triangle inequality. The inequalities dy(x,y"),dy(y" ,y) > M 
require that Y is thin along both [x, y"} and [y", y}; by the connectedness of Ty, this 
implies that a — dY is thin at y" and consequently disjoint from dv\. Therefore, 
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since W is disjoint from a, we may safely assume that ds{dW, dV\) < 2 in the 
exceptional case W ^ Y. 

We now make the following observation: 



and likewise for z,V2,72- We argue by contradiction. Otherwise Ty 1 < T Z and so 
we have d z (x,y") < M which implies d z (y",y) > 38M. Since d z (y,z) < 10M and 
dv 1 (y,z) > 10M this would violate Lemma [TT] viewed from y. Namely along [y,y"] 
we pass through T z and then Ty x while along [y, z] we pass through T z with a 
much smaller projection. 

For each Y G SI", we now see that the associated subsurface W £ SI satisfies 
ds(dW, dVi) < 2 for either i = 1 or i = 2. We have already observed this when 
W Y, and in the case that W = Y we in fact have that Y is disjoint or nested 
with respect to either V\ or V2. Indeed, if Y (tl Vj for both i = 1,2, then (fTTj) would 
imply that Ty C [x,y"\ and TV C [x,z"]. But then d Y {y" ,y),d Y {z" , z) < M and 
since dy(y, z) < 10M we have contradicted dy{y" , z") > 50M. 

We are ready to define the back up points y' and z' . Choose any Y £ SI" (recall 
that we are assuming SI" ^ 0) and let W be the associated subsurface. If W is 
disjoint or nested with respect to V\ then define y' = y" . Otherwise (jTTJ) implies 
that T\y < Tvi along 71; in this case we back up farther and define y' to be the 
beginning of Tyj/ along [x, j/]. Define the point z' similarly. 

Since either V\ or W is thin at y' and ds (dW, Z) < 2 for some surface that 
is thin along [y,z] (namely, V± or V2), we conclude that ds(y,y') (and similarly 
ds(z,z')) is bounded relative to ds(y,z), as in (i). If we have backed up farther 
on both sides, then W is thin at both y 1 and z' so that ds(y' , z 1 ) < 4 (recall that 
a Bers marking has diameter 2 in C(S)). If we backed up on just one side, then 
we have a path of length 1 in the curve complex (either dW — dV2 or dV\ — dW) 
showing that ds(y' , z') < 5. Finally, if we didn't back up on either side, then the 
path dV\ — dW — dVi of length 2 in the curve complex implies ds{y' , z') < 6. This 
verifies (ii). 

We now consider (iii), so we are assuming that ds{y,z) < d. The distance 
formula (p} says 



where K, C are the constants coming from threshold Mq — 10M. Let us write 
di2 for d-j-(y,z). Take Nq > 2K d + 2C, so that the assumption of (iii) says that 
dig > 2Kd + 2C. This gives 



For each V € SI, the definition of y' and z' ensures that Ty n[x, y'} — Ty P\[x, z'\ — 
0; therefore d v (y',z') < 2M. Therefore 

dv(y, z) < dv(y,y') + dy{z, z') + 2M. 
Furthermore, since dv(y,z) > 10M, it follows that 



(11) 




d r (y,z) < K d s {y,z)+Y^dv{y,z) + C < Kd+ C + Kj2dv(y,z) 
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Notice that it cannot be the case that dv(y,y') < 3M and dv(z,z r ) < 3M, for 
this would imply that d v (y,z) < 3M + 3M + 2M < 10M, which is not the case. 
Therefore at least one of dy(y, y 1 ) or dy (z, z') is larger than 3M. We now have that 

i d v(y, y')] 3 M + i d v(z, z')] 3M >d v (y, y') + d v (z, z') - 3M 

>^d v (y, z) - 3M > Q - d v (y, z) = ^d v (y, z). 

Let K',C be the constants in ([1) for the threshold Mo = 3M and enlarge No if 
necessary to ensure that d\2 > WC'K'K. The distance formula then gives 

dr(y,y') + d T (z,z') >i ^d s (y,y') + d s (z,z') + ^ [dv(y,y')] 3M + [dv(z,z%^J 
>—Y -d v {y, z) - 2C > - 2C 



> 



Rl ^ 2 / - 4K , R 

d\2 di2 di2 



AK'K 8K'K 8K'K' 
so we are done if we take cq = 16 ^, K ■ O 

Next we show that a pair of long geodesic segments which both stay far apart 
from each other in T(S) and spend a large fraction of their time in 7l must have 
big curve complex distance at some point. We will repeatedly use Lemma l39l to 
"back up" along each of the two rays in order to make distance estimates. 

Theorem 40 (Curve complex distance estimate). Fix < S < 1. There is a 
constant A > 1 such that for each d > 6 there exist constants Dq,Rq > with 
the following property. Given any T > R$, let 71 = [x,y] and 72 = [x,z] be two 
Teichmiiller geodesies based at x with lengths T\,r% > XT. Suppose that 

(1) Th^[x, y) and Th^ [x, z] are both at least 5; and 

(2) for all t > T, the point 71 (t) is not contained in the Do-neighborhood of 
the geodesic [x,z], and similarly for 72 (i) and [x,y]. 

Then ds{y, z) > d. 

Proof. Choose A such that AS > 1 and define 8' = ^Er i notice that < 8' < 5. Let 
p, Ri and p',R[ be the corresponding constants guaranteed by Definite Progress 
( Proposition [38]) . Set fco, Mq, No, and Co to be the constants from the backing up 
lemma (Lemma I3T))) for our given d, and let K > 1 and C > be large enough to 
be constants in the distance formula (dJ for the threshold M . Let D and Ro be 
any constants which satisfy 

A)>max|iVo, ~~~~~~T) — , K(2k + l)d + c\ , and 
I cop' c J 

Ro > max { i?i , — 



Let T > Ro, [x,y] and [x,z] be as in the statement of the theorem; in order to 
derive a contradiction, assume furthermore that ds{y,z) < d. 

Set yo — y an d zq — z. By repeatedly backing up along 71 and 72, we will 
recursively define sequences of points {yi}, {zi}. Each step of the sequence will cover 
a large Teichmiiller distance and a comparatively small curve complex distance. 
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After backing up sufficiently far, we will eventually contradict the fact that each 
geodesic 7; spends a large fraction of its time in % . 

Suppose that the points have been defined and satisfy 



(Notice that these conditions hold for the initial points yo and zq.) Backing up 
along the rays as in Lemma 1391 we then obtain new points Vi+\ G and 
Zi + i G [x, Zi]. For these, we have that ds{yi+i,Vi) and ds{%i+\, Zi) are bounded 
above by ko ■ ds{yi, Zi) < k^d, by property (i) of backup points. We also have that 
either [j/i+i , yi] or [zi + i,Zi] (or both) has length at least cq ■ d-r{yi,Zi) > cqDq, by 
(iii). We are free to continue defining new points in this manner as long as the 
conditions of Q remain satisfied. 

Suppose then that we have applied this procedure m times and arrived at points 
y m and z m . At each step of this process we traveled back a Teichmuller dis- 
tance of at least coDq along one of the two segments 71 or 72. Therefore on 
at least one of the geodesies we have traveled a total Teichmuller distance of at 
least coDom/2. Without loss of generality, suppose 71 has this property; then 
dT(ym,yo) > CoDom/2 > R[. On the other hand we have ds(y m ,yo) < kodm, 
since at each step we travel at most kgd in the curve complex. Therefore, along 
the geodesic segment [y mi yo] the ratio of curve-complex distance to Teichmuller 
distance is 

ds{y m ,yo) < 2k dm , 
dr(Vm,yo) ~ c D Q m 

Proposition l38l now implies that Th^[y m ,yo] < 

Let t m G [0,ri] be the time for which 7i(£ m ) = y m - We claim that if t m > ri/A, 
then the points y m , z m satisfy Q so that we may reapply Lemma [35] and back 
up farther to points y rn +i and z m+ i. Firstly, since t m > r%/\ > T, hypothesis 
(ii) in the theorem implies that y m = 71 (t m ) is not within Dq of any point on 72; 
whence d-]-(y m , z m ) > Dq. Applying the triangle inequality to the series of points 
implies that ds(y m , z m ) < (2k + l)d. Now if d Y (y m -z m ) < M 
for all proper subsurfaces Y, then the distance formula ([1]) gives 

d T (y m ,z m ) < K -c? s (y m ,z m ) + C < K(2k + l)d + C < D , 

which is a contradiction. Thus by Lemma [39] we have ds(y m , z m ) < 6 < d. Finally, 
since the fraction of [t m , r{\ that 71 spends in Te is at most 6' < 6, it must be that 
the thick-stat along [0,t m ] is at least S (since the thick-stat of the whole of 71 is 
S). By Proposition 1381 since t m > T > Ri, it now follows that ds(y m ,x) > pt rn > 
pR$ > 2d. As ds{y m , z m ) < d, we see that z m 7^ x. Thus we have verified the 
conditions of @. 

We have thus verified that we can repeatedly back up until we reach a point 
y m = 7i(i) on [x,y] (or similarly a point on [x, z]) such that t < 77/A and the 
fraction of [t, rjj that spends in T ta is strictly less than S' . It now follows that 
the amount of time that 71 spends in Te along [0,ri] is strictly less than 



again by shifting weight as in ([T0|) . This contradicts the first hypothesis of this 



(*) 



ds{yi,Zi) < d; 



dr(yi,Zi) > -Do; 



y%^ Zi ^ x . 




theorem. 



□ 
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6.3. Teichmiiller distance. The previous theorem implies that, under suitable 
conditions, we may assume our two Teichmiiller geodesies stay far apart in the 
curve complex beyond some radius AT. We now show that in this situation, the 
distance between two points on the sphere of radius r ^> AT is on the order of 2r. 




x 



Figure 3. We will end up showing that [y, z] "dips back" to within 
bounded distance of the basepoint and thus has length nearly 2r. 



Theorem 41 (Teichmiiller distance estimate). Fix any < S < 1. There exists 
a constant H > such that for all sufficiently large Ra,d the following holds: If 
71,72 are geodesic rays based at x € T{S) which satisfy 

(a) Th^ [x, 7,(r)] > S for all r > i?o and i — 1,2, and 

(b) for all ti,t 2 > Rq we have ds{ji(ti), 72^2)) > d. 
then they also must satisfy 

drdiin), 72(^2)) > n + r 2 - HR a 

for all r± , r-i . 

Proof. We fix some constants: 

• Let K > 1 and C > be large enough to be quasi-isometry constants in 
the distance formula ([TJ) for threshold 10 M, and such that the projection of 
every geodesic in T(S) to C(S), concatenated with a geodesic segment of 
length at most 5, is an unparameterized (K, C)-quasigeodesic in C(S); 

• Let t be is a constant such that (K, C)-quasigeodesic quadrilaterals in the 
curve complex are r-thin; 

• Let p,Ri be the constants corresponding to 5 from Proposition [35] 

• We furthermore require that i?o > max(i?i,4 + C) and d > 10 + r. The 
desired constant H will not depend on Rq or d. 

Now write y = 71 (n), y = Ji(Ro), z = 72(72), z o = 72(^0), and let 



n = {VCS\T v n{y ,zo]^®} 
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be the set of subsurfaces which are thin somewhere along the geodesic segment 
[yo,zo]. For each V £ f2 it is possible that it determines a nonempty thin interval 
Ty which intersects 7i[i?o,oo)- We define 

ti = sup (J Ty nji[R ,oo). 
ven 

In other words, ti £ [Rq, oo) is the smallest time such that for any V £ and for 
s > t.^ 7i(s) ^ Ty. It follows that for each i = 1, 2, there exists some Vi G f2 such 
that the Bers marking at 7i(ij) contains 9Vi. 

By definition of Vj € f2, there exists a point Wi £ [yo, zq] such that V% is thin at 
Wi] in particular ds(wi, 7i(ij)) ^ 4. Since itij is at most i?o away from an endpoint 
of [yo, zo], the triangle inequality implies that dj-(x, Wi) < 2i?o- We then have that 

d s (x, 7i(*i)) < 4 + d 5 (x, Wi) < 4 + 2KR + C < 3KR , 

where the middle inequality is an application of the distance formula ((T|) and last 
inequality holds since Rq > 4 + C. Furthermore, since ti > Ro, our hypotheses 
imply that 7$ spends at least SU time during [0,£<] in 7I . Since i?o > Rii by 
Proposition [38] we have that 

U < -ds{x,7i(ti)) < — Ro- 
P P 

Let L — 3K p g ° , and note that L > t\,t%. If the geodesic segment ji[ti,L] were 
completely contained in the thin part, then the fraction of [0,L] that 7$ spends in 
T eo would be at most 

ti 3KRq 
L < — =S - 

As this is not the case, there must exist a time t\ £ [U, L] at which 7i(i£) £ 7^ . Let 
y' and 2/ equal Ti f° r * ~ 1, 2, respectively. Note that in particular 

(12) dT ^y') jdr ^ Z ')<^p.. 

By the above estimate, the theorem is trivially true if either r, < t\ if we choose 
H > ^j, so it is enough to prove the theorem for both > ij. That is, [x,y'\ C 
[x, y] and [x, z'] C [ar, z]. Now let 

^1 = {W c S I <My', y) > 10M}, fi 2 = {W C 5 I <Mz', 2) > 10M}. 

Note that a marking projects to the curve complex as a set with diameter 2. Also 
if W\ £ Oi and W2 £ ^2 were disjoint or nested, then their boundaries would have 
distance < 1 in the curve complex, so markings where those boundaries are short 
would have distance < 5. Thus in light of assumption (b) and since d > 10, we 
have that W\ iti W2 for all W\ £ f2i and W2 £ The same argument shows that 
W2 £ cannot determine an thin interval along [y',y] and similarly for W± along 
[z', z]. In particular £li n VI2 = 0- 

Suppose V is any subsurface which is thin somewhere along [y',y] (for example, 
if V £ fii). Since its thin interval Ty along [x, y] is connected and y' is thick, we see 
that V is not thin along [yo,y']. Furthermore, the definition of t j implies that V is 
not thin along [yo, z ], and assumption (b) plus the choice of d shows that V cannot 
be thin along [2:0,2/]. Therefore, the triangle inequality gives dy(y',z') < 3M for 
these V. Letting 

fi' = {Z c s I d z (y',z') > 10M}, 
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we conclude that each surface Z € 0' cannot be thin along [y',y] (or [z',z]). In 
particular, the three collections Qi, $72, and Q' are pairwise disjoint. 
Next we establish that for any W £ fii U £^2 U $7', 



This is because we know that for each W £ fii, W is not thin along [z', z] and that 
dw(y', z') < 3M as above. Since dw{y',y) > 10M, the triangle inequality gives 
dv^(y, > 10M — 3M — M = 6M. The same argument applies to ^2- Similarly, for 
each Z £ Q', we have both d z {y',z') > 10M and d z {y' ,y), d z (z' , z) < M, so that 
z) > 8M. We conclude that each W £ f2i U f22 U f2' determines a nonempty 
thin interval T^y along [y, z\. 

Let p € [y, z] denote the last point (in traveling from y to z) at which any 
subsurface in f^i is thin and let W± £ Jli denote the corresponding subsurface. 
Analogously define q and W 2 £ fl 2 on [z,y]; p' and € Oi on [y, J/']; and q' and 
Z2 € il2 on [z,z']. (If Qj = then we take the initial endpoints of the respective 
intervals and leave the associated subsurfaces undefined.) Thus the Bers markings 
at p, q,p', q' contain dW\, dW 2 , dZi, dZ 2 , respectively. 

It cannot be the case that W\ ft) Z\ , as this would violate the time-order principle 
(Wi occurs after Z\ on [y, z] but before Z\ on [y, y']). Therefore dW\ and dZ\ either 
coincide or are disjoint; in either case we have ds(p,p') < 5. The same argument 
shows that ds(q,q') < 5. 

Next we want to verify that p occurs before q along [y, z]. This is another 
application of time ordering: suppose for contradiction that Tw 2 appeared before 
7Wi along [y,z]. (Recall that W\ ftl W2, since this was verified for all subsurfaces 
from f^i and ^2 above.) Then since dw x (y> z), dw 2 (y, z), dw x (y'i y) are all more than 
3M, Lemma [TT1 ensures that T\v 2 also appears along [y, y']. But this contradicts 
the fact that no surface in f2 2 has a thin interval along [y, y'\. 

The setup is now complete, and we are ready to prove the following: 

Claim 42. There exists a constant A > (independent of r 1; r 2 , Rq, d) such that 
the minimal distance between the segments [y,z] and [y',z'] is at most XRq. 

It is easy to see that the theorem follows from this claim. Suppose that the 
geodesic segment [y,z] comes within distance XRq of the geodesic \y',z']. As this 
latter segment is contained in the ball of radius 2L = ^-Ro around x, this means 
that there must be some point w on [y, z] whose distance from x satisfies df{w, x) < 
^jRq + XRq- Since y and z are on the spheres of radius ri,r 2 centered at x, this 
proves that 



Proof of claim. Consider any two points w' £ [y', z'\ and w £ [p, q]. Aiming to use 
the distance formula ([1]), we bound the projections dv(w, w') in terms of dy(y' ', z') 
for all proper subsurfaces V C S. 

As usual, it will require special care to deal with the case when V is an annulus 
for which the reverse triangle inequality fails. Let T denote the set of annular 
subsurfaces V for which w £ Ty and Lemma IT21 fails for V along [y,z]. Let T' 



d w (y,z) > 6M. 



d T (y, z) = d r (y, w) + d r (w, z) > r x + r 2 - 2 — + A ) Rq, 




establishing the theorem with H — 2 i^j + X 
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denote the analogous set for w' and [$/', z'] . Notice that, since at most 3(7 — 3 curves 
can be short at to or to', we have |T|, |T'| < 3g — 3. 

Below, it will be essential to bound the numbers G?y(y', to'), dy(w ', z'), dy(y, to), 
and dy(w, z). First, suppose that V ^ T because w <f Ty. Then, by the connected- 
ness of Ty, we have either dy(y, to) < M or dy(w, z) < M; whence, by the triangle 
inequality, we may conclude that 

dv(y,w),dv(w,z) < d v (y,z) + M. 

On the other hand, the reverse triangle inequality implies that the same bounds 
hold for every other V $ T as well. For all V $ T' we have the analogous bounds 

d v {y ,w'),d v {w' , z') < dv(y',z') + M. 

We now proceed to bound dv{w' ,w) for all proper subsurfaces. Firstly, any 
V € fii is not thin along [to, z] or [z, z']; therefore, 

d v {w',w) < d v {w', z') + dv(z',z) + d v {z.w) < d v (w',z') + 2M 
Similarly, any V € is neither thin on [y',y] nor [y, w], so we have 

d v (w',w) < dv(w',y') + d v {y' ) y)+d v (y,w) < d v (w',y') + 2M. 

In either of the above cases, these bounds reduce to 

d v (w',w) < d v (y', z') + 3M 

provided that V ^ T'. If V is in T', applying Theorem [13] with constant 35M, there 
exists B' such that either 

d v (y', w'),dv{w', z') < d v {y', to') + d v {w', z') < d v {y', z') + B' , 

or there is a collection {Uj} of subsurfaces that do satisfy Lemma [T2l along [y 1 , z'], 
are disjoint from V, each satisfy d\j j (y' , to') > 35M, and such that 

dv (y', w') < J2 d Vj (y', w') < Ysidu, (y', z') + B) < ^ 2d Vj (y' , z'), 

U-j Uj Uj 

< 2 [dY(y\ z% m < 2Kd T (y' 1 z') + 2C, 

YCS 

where the second inequality above follows from Lemma[l2j and the third and fourth 
from the fact that d V] (y',z') > djj^y' ,w') - B > 34 M > B. Replacing y' with z', 
we obtain the same bound on c?y(z',to') as well. (Notice that these bounds on 
dv(y',w') and dy(z',to') do not use the assumption V £ £!i U il2-) To summarize, 
for all V £ Oi U ^2, we have that 



dv(w' , to) < 



d v (y',z')+3M, if V $ V 

d v (y', z') + 3M + B' + 2Kd r (y', z') + 2C, if V e T'. 



Lastly, for a proper subsurface V ^ £li U O2 (such as V £ f2'), we have 

d v (w',w) < d v {w' ,y') + d v (y' ,y) + d v (y,w) < d v {y' ', to') + 10M + d v {y,w). 

The same holds with y, y' replaced with z, z' . As above, the dy (to', y') (or dy (to', z')) 
term may be bounded by either 

d v (y',z') + M or d v (y', z') + M + B 1 + 2Kd T (y', z') + 2C 
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depending on whether or not V £ T'. If V £ T, the dv(y, w) (or dv(w, z)) term is 
bounded similarly: 

d v {y, w), d v (w : z) < d v (y, z) + M < d v (y', z') + 21M. 

When V £ Y, we instead bound the dv(y,w) (or dv(w, z)) term as follows: By 
Theorem [T3l (applied with constants 35M and £?'), we either have 

d v (y, w) 7 d v (w, z) < d v (y, w) + dv(w, z) < d v (y 1 z) + B' 

<d v (y',z') + 20M + B', 

or there exists a collection {Uj} of subsurfaces that are disjoint from V, satisfy 
Lemma [T2l and d\j j (y, w) > 35M, and such that 

dv(y,w) < ^2d Vj (y,w). 

Lemma IT2l implies that du j (y, z) > 34 M so that each Uj has large projections along 
both [y, w] and \y,z\. The only such subsurfaces are contained in Q\ U 57' (since 
W ^ Oi U ft' U VL2 implies dw(y, z) < 30M). Running the argument with y replaced 
by z, we obtain another such collection contained in U fl' . Since all of these 
subsurfaces are disjoint from V , the fact that elements of f2i and are far apart 
in C(S) implies that one of these collections must, in fact, be contained in il'. 
Therefore we may assume that {Uj} C fi'. We then have 

du 3 (y, w) < d Uo (z, y) + B< d Vj (y', z') + 20M + B < 3d Vj (y', z'), 

where the last inequality holds because du^y 1 \z') > 14M. Thus we conclude that 

d v (y,w) < Y,du 3 {y,w) < 3 53 [d Y (y',z% m < 3Kd r (y',z') + 3C. 

Uj YCS 

Putting these estimates together, we see that for all V ^ fii U ^2 we have 

'2d v (y',z') + 32M, ifV^^T'uT 
2d v (y', z') + 32M + 2B' + 5Kd T (y', z') + 5C, if V eViiT. 



dv(w' ,w) < 



In fact, comparing with the above estimates for W 6 fli U f!,2, we see that these 
bounds hold for all proper subsurfaces V C S. 
Observe that for any threshold N > we have 

[A + B] N < 2[A] N/2 + 2[B] N/2 and [2A] N = 2[A] N/2 
for all A,B>0. Set a threshold N = 200M + 10B' + 25C*. Then, for all V C 5, 
[d v (w, w')] N < [2d v (y\ z') + 32M] W < 2 [2d v (y', z')] N/2 + 2 [32M] JV/2 
= 4[d v (y',z')] N/4 < 8[d v (y',z')] N/s 
provided that V \t T' U T. We similarly have 
[d v (w, w')] N < [2d v (y', z') + 32M + 5Kd T {y', z) + 5C] N 

< 4 [2d v (y', z')] N/4 + 4 [32M] Ar/4 + [5Kd T (y\ z')] N/4 + 4 [2B 1 + 5C] N/4 
<8[d v {y',z')] N/s + 5Kd r (y',z') 

when V G T' U T. Moreover, recall from ([12]) that d T (y',z') < ^fRo- Letting K' 
and C be sufficiently large quasi-isometry constants such that the distance formula 
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{I} holds for the thresholds N and N/8 (note that K', C are independent of Rq), 
we conclude that 



Thus we have bounded the Teichmuller distance across the green quadrilateral in 
Figure [3] linearly in terms of the corresponding curve complex distance and Rq. We 
will now use the fact that quasigeodesic quadrilaterals in the curve complex are thin. 
Let 7r be the projection map to the curve complex. On the left, we let 7l be the 
concatenation of the quasi-geodesic segment n[y',p'] with the geodesic [n(p'),n(p)]; 
likewise, on the right, 7^ is the concatenation of n[z',q'] and {ir(q'),Tr(q)]. These 
are actually (K, C)-quasigeodesics because the second segment has bounded length: 
ds(p,p'),ds(q,q') < 5 (recall the definition of K and C). Thus the quasi-geodesic 
segments 7^, ir[p, q], jr, and ir[y',z'] form a (K, C)-quasigeodesic quadrilateral, 
and we conclude that each side is contained in the r-ncighborhood of the union of 
the other three sides. 

Now, the separation hypothesis (b) of the theorem implies that no point on 
Tr[y',p'] is within d of any point on tt[z' ,q'\. Therefore, no point on ji, is within 
(d — 10) of any point on 7^. Recall that g? — 10 > r, which implies that no point on 
7l is contained in the r-neighborhood of 7^. This means that there exist points 
on 7r[p, q] and Tr[y', z'] that are within 2r + 1 of each other (since each point on 7^, 
say, is within r of one or the other, and the points of jl are separated by one). By 
definition of projection distance, these correspond to points w £ [p, q],w' € [y',z'] 
with ds{w, w') < 2t + 1. Combining this with our above estimates and using the 
fact Rq > 1, we find that 



We can now assemble our results to prove Theorems Q] and [2j Our distance 
estimates deal with annuli of fixed width. 

Theorem 43. Fix a basepoint x G T{S) and let fi be any measure on T(S) sat- 
isfying a thickness estimate and an exponential decay estimate. Fix an arbitrary 
k > 0. Then 




< K'd s {w, to') + K'(8K' + 30gK)—R + 9K'C 




This completes the proof of Claim and Theorem @T] 



□ 
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We have shown that these hypotheses are satisfied by the standard visual mea- 
sures Vis(iAr) and Vis(s x ) (Proposition l24l Theorem Wf\ . as well as the holonomy 
measure m (Theorem [^51 Theorem 125)) . and therefore the Hausdorff measure n, the 
symplectic measure n, and the Finsler measures considered above. 

Proof. Set S — 1 /2 and let A, H be the constants guaranteed by Theorems 00] and 
I4T1 Choose constants e > 0, a > 0, and Mo > 0. After choosing a sufficiently large 
thick part T tQ and restricting to a subset E r C A^.(x) x A*(x) whose complement 
has proportional measure at most e (for sufficiently large r), we may suppose that 
all pairs (3/1,2/2) S E r satisfy 

Thf [x,ji(t)}>6 and d T {li(t) ll2 {t)) > M a 

for all t > or, where 74 is the geodesic ray based at x through yi. Notice that, in 
this case, the point 71 (t) cannot be within M /2 of any point on the geodesic ray 
72 (and similarly for 72 (t) and 71). Therefore, since M may be chosen arbitrarily 
large, for each d > 6 Theorem FfUl guarantees that d$ (71 (ii), 72(^2)) > d for all 
r» 1 and t\,t2 > Aor. Choosing d to be sufficiently large, we may then apply 
Theorem [4J] to conclude that for all r ^ 1 and all (2/1,2/2) G -Ey we have 

dr(li( r i),l2(r2)) >ri+r 2 - HXar 

for all ri,r2- Putting the above estimates together, we find that 

1 1 f 
liminf 7 Aki \\2 / d T {y,z)dfi{y)dn(z) 

> liminf -(1 - e)(2r - 2fe - iJAcrr) = (1 - e)(2 - iJAa). 

Since e and tr can be chosen arbitrarily small, the result follows. □ 

We immediately deduce the same result for balls if the measure has definite 
exponential growth. 

Theorem 1. Fix a basepoint x and let fi be any measure satisfying a thickness 
estimate and an exponential decay estimate, for which there is definite exponential 
growth C ie ar < n(B r (x)) < C 2 e hr . Then 

1 1 f 

lim in 1 \\2 \ d T~(y> z ) d/j(y)dn(z) = 2. 

r^oo r fJ,(B r (x)y JB r {x)xB r (x) 

The growth condition, verified for our measures in Corollary 1201 ensures that 
most of the ball is concentrated in the annulus (Lemma l2Tj). 

Theorem 2. Fix x G T(S). For either of the standard visual measures fi x = 
Vis r (u x ) or Vis r (s x ) on the Teichmuller sphere S r (x) of radius r, we have that 

E(T(S),t* x ) = 2. 

Proof. Visual measures on T(S) are constructed by radially integrating these visual 
measures on spheres; in fact, Proposition [24l and Theorem [27] from Sj5]were proved 
for annuli by first verifying them for spheres. Thus we have both a thickness 
estimate and an exponential decay estimate for visual measures on spheres. The 
result follows. □ 
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